Decision Tree Algorithm Considering Distances Between Classes

نویسندگان

چکیده

Decision tree algorithm (DT) is a commonly used data mining method for classification and regression. DT repeatedly divides dataset into pure subsets based on impurity measurements such as entropy Gini. Then relatively “pure” partitions consisting of observations with the (almost) same class are obtained. Gini index one representative indices measuring data. However, does not take account distances between classes. If classes considered when impurity, decision can distinguish clearly different To end, new Rao-Stirling proposed considering considers in way that weights more to pairs references distant impurity. Experimental results indicate superior terms accuracy, implying help improve accuracy DT.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On Distances between Isomorphism Classes of Graphs

Various measures of distance between (the isomorphism classes of) two graphs have been proposed. These include measures proposed by Zelinka [6],[7], Baláž, Koča, Kvasńıčka & Sekanina [1] and Johnson [4]. Further, there is the edge rotation distance of Chartrand, Saba & Zou [3] and the edge shift distance of Johnson [5]. It is on the latter two that we build. For terminology and notation not exp...

متن کامل

A hybrid genetic algorithm / decision tree approach for coping with unbalanced classes

This paper proposes a new approach for coping with the problem of unbalanced classes, where some class(es) is(are) much less frequent than the other(s). The proposed approach is a hybrid genetic algorithm / decision tree system. The genetic algorithm acts as a wrapper, using the output of a decision tree algorithm (the state-of-the-art C5.0) to compute the fitness of population individuals (can...

متن کامل

Structure and Majority Classes in Decision Tree Learning

To provide good classification accuracy on unseen examples, a decision tree, learned by an algorithm such as ID3, must have sufficient structure and also identify the correct majority class in each of its leaves. If there are inadequacies in respect of either of these, the tree will have a percentage classification rate below that of the maximum possible for the domain, namely (100 Bayes error ...

متن کامل

Global Tree Optimization: A Non-greedy Decision Tree Algorithm

A non-greedy approach for constructing globally optimal multivariate decision trees with xed structure is proposed. Previous greedy tree construction algorithms are locally optimal in that they optimize some splitting criterion at each decision node, typically one node at a time. In contrast, global tree optimization explicitly considers all decisions in the tree concurrently. An iterative line...

متن کامل

CUDT: A CUDA Based Decision Tree Algorithm

Decision tree is one of the famous classification methods in data mining. Many researches have been proposed, which were focusing on improving the performance of decision tree. However, those algorithms are developed and run on traditional distributed systems. Obviously the latency could not be improved while processing huge data generated by ubiquitous sensing node in the era without new techn...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2022

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2022.3187172